PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Lsa000228
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Cichorioideae; Cichorieae; Lactucinae; Lactuca
Family HD-ZIP
Protein Properties Length: 839aa    MW: 92246.9 Da    PI: 5.1392
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
gnl|UG|Lsa#S58695800PU_refUnigeneView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.95.6e-21105160156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                +++ +++t++q++e+e+lF+++++p+ ++r +L+++lgL+ rqVk+WFqNrR+++k
  Lsa000228 105 KKRYHRHTAHQIQEMEALFKECPHPDDKRRMRLSQELGLKPRQVKFWFQNRRTQMK 160
                688999***********************************************998 PP

2START151.85.7e-483075322206
                HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS..............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEE CS
      START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.............dsgealrasgvvdmvlallveellddkeqWdetla....kaetlev 84 
                +a++ ++e+vk+  ++e +W+k+   + g+evl   + +k              +++ea ras vv+ ++++lv  +ld   +W e ++    +a+tl+v
  Lsa000228 307 FALSFMDEVVKMCRLGESLWTKVN--DSGKEVLNLDQHAKMfpcfisqkgdpneFRYEATRASSVVIINSVTLVDAFLDAD-KWMELFPsiisRAKTLQV 403
                577889******************..77777777666655577777788899999**************************.****************** PP

                ECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSX CS
      START  85 issg......galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrl 176
                ++sg      g lqlm+aelq+lsplvp R+  f+Ry+ q + +g+w+ivd  +ds  +   ++s+ R +++pSg++i++++ng+s vtw+eh++ ++  
  Lsa000228 404 VTSGvngnanGSLQLMYAELQMLSPLVPtREIHFLRYCAQnSDDGSWAIVDFPLDSFHETY-QASLTRYKRRPSGCIIQDMPNGYSSVTWIEHAEAEDEP 502
                ****************************************999********9999988877.9************************************* PP

                XHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
      START 177 phwllrslvksglaegaktwvatlqrqcek 206
                +h ++   v+sg+a+ga++w+a lqrqce+
  Lsa000228 503 VHGIFTDYVSSGMAFGARRWLAVLQRQCER 532
                ****************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.37E-2091163IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.2E-2299168IPR009057Homeodomain-like
PROSITE profilePS5007117.216102162IPR001356Homeobox domain
SMARTSM003891.4E-19103166IPR001356Homeobox domain
CDDcd000863.73E-19105163No hitNo description
PfamPF000461.6E-18105160IPR001356Homeobox domain
PROSITE patternPS000270137160IPR017970Homeobox, conserved site
PROSITE profilePS5084844.28297535IPR002913START domain
SuperFamilySSF559615.36E-31299534No hitNo description
CDDcd088758.71E-111301531No hitNo description
SMARTSM002341.0E-34306532IPR002913START domain
PfamPF018521.9E-39307532IPR002913START domain
Gene3DG3DSA:3.30.530.202.0E-4361493IPR023393START-like domain
SuperFamilySSF559611.03E-16550788No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 839 aa     Download sequence    Send to blast
KKFYITLQQR EGVHIRNYTD TGKMFGDCQV MSSMGGGSGI SSESIFPNFN FMPFTDFSFV  60
IPKEEMESMV RSGKEEMESG SGSEQIEGGV SGNEQDTDQQ PPAKKKRYHR HTAHQIQEME  120
ALFKECPHPD DKRRMRLSQE LGLKPRQVKF WFQNRRTQMK AQQDRADNAI LRNDNDNLRN  180
ENSRLQSALR ALVCSNCGGS TISFDEQQLR MENARLKEEL DRICLIATQF CGSDPMQGQG  240
HGPLSSLMTS NLDLDMNMYP RAYDQEGLPN CSEMLQMNPL MASEGPNFAA ANGLIIMDDE  300
KPLAIQFALS FMDEVVKMCR LGESLWTKVN DSGKEVLNLD QHAKMFPCFI SQKGDPNEFR  360
YEATRASSVV IINSVTLVDA FLDADKWMEL FPSIISRAKT LQVVTSGVNG NANGSLQLMY  420
AELQMLSPLV PTREIHFLRY CAQNSDDGSW AIVDFPLDSF HETYQASLTR YKRRPSGCII  480
QDMPNGYSSV TWIEHAEAED EPVHGIFTDY VSSGMAFGAR RWLAVLQRQC ERLASLMARN  540
ISDIGAISSP EARKNLMNLA QRMVRMFCLN ITGSCGQSWT ALSDSVEDTV RITTRKVTEP  600
GQPNGLILTA VSTTWLPHPH YQVFDLLRDE RRRSQLDVLS NGIPLQEVAH IANGSHPGNS  660
ISLLRLNVAS NSSQNVELVL QESCTDDSGS LVVYSVVDVD AIQHTMSGED PSCIPLLPLG  720
FVIVPAGQNP NDNITSTTTT TGNSTESGGG CLLTVGLQVL ANAIPTAKLS LSSANTVNNH  780
IQSTVQQIIA ALGGVAAGNG NSTGGGNSGS SDGDNVGGFD EPATRPIKKV DLSPTSKVN
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010663102.10.0PREDICTED: homeobox-leucine zipper protein ROC3
RefseqXP_010663098.10.0PREDICTED: homeobox-leucine zipper protein ROC3
RefseqXP_002277673.10.0PREDICTED: homeobox-leucine zipper protein ROC3
SwissprotA2ZAI70.0ROC3_ORYSI; Homeobox-leucine zipper protein ROC3
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLA0A103XQT70.0A0A103XQT7_CYNCS; Homeobox, conserved site-containing protein (Fragment)
STRINGVIT_02s0012g02030.t010.0(Vitis vinifera)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7